Statistical speech-to-speech translation with multilingual speech recognition and bilingual-chunk parsing

نویسندگان

Bo Xu

Shuwu Zhang

Chengqing Zong

چکیده

Initiated mainly from speech community, researches in speech to speech (S2S) translation have made steady progress in the past decade. Many approaches to S2S translation have been proposed continually. Among of them, corpus-dependent statistical strategies have been widely studied during recent years. In corpus-based translation methodology, rather than taking the corpus just as reference templates, more detailed or structural information should be exploited and integrated in statistical modeling. Under the statistical translation framework that provides very flexible way of integrating different prior or structural knowledge, we have conducted a series of R&D activities on S2S translation. In the most recent version, we have independently developed a prototype Chinese-English bi-directional S2S translation system with the supports of multilingual speech recognition and bilingual-Chunk based statistical translation techniques to meet the demand of Manos – a multilingual information service project for 2008 Beijing Olympic Games. This paper introduces our works in the research of multilingual S2S translation.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Automatic speech recognition framework for multilingual audio contents

Automatic speech recognition (ASR) for multilingual audio contents, such as international conference recordings and broadcast news, is addressed. For handling such contents efficiently, a simultaneous ASR is promising. Conventionally, ASR has been performed independently, namely language by language, although multilingual speech, which consists of utterances in several languages representing th...

متن کامل

Fast Calculation of Translation Model Score for Simultaneous Automatic Speech Recognition of Multilingual Audio Contents

This paper addresses automatic speech recognition (ASR) for multilingual audio contents, such as international conference recordings and broadcast news. For handling such contents efficiently, a simultaneous ASR is promising. Conventionally, ASR has been performed independently, namely, language by language, although multilingual speech, which consists of utterances in several languages represe...

متن کامل

Services to Support Use and Development of Speech Input for Multilingual Multimodal Applications for Mobile Scenarios

Speech is our most natural form of interaction. Developing speech input modalities for several languages, combining speech recognition and understanding, presents various difficulties. While automatic translators ease the translation of normal text, the adaptation of grammars for several languages is currently performed based on an ad hoc approach. In this paper, we present a novel service that...

متن کامل

A Trainable Approach for Multi-Lingual Speech-To-Speech Translation System

This paper presents a statistical speech-to-speech machine translation (MT) system for limited domain applications using a cascaded approach. This architecture allows for die creation of multilingual applications. In this paper, the system architecture and its components, including the speech recognition, parsing, information extraction, translation, natural language generation (NLG) and textto...

متن کامل

An Efficient Unified Extraction Algorithm for Bilingual Data

The paper presents a unified algorithm for aligning sentences with their translations in bilingual data. The sentence alignment problem is handled as a large-scale pattern recognition problem similar to the task of finding the word sequence that corresponds to an acoustic input signal in isolated word automatic speech recognition (ASR). The algorithm gains efficiency from related work on dynami...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2003

Statistical speech-to-speech translation with multilingual speech recognition and bilingual-chunk parsing

نویسندگان

چکیده

منابع مشابه

Automatic speech recognition framework for multilingual audio contents

Fast Calculation of Translation Model Score for Simultaneous Automatic Speech Recognition of Multilingual Audio Contents

Services to Support Use and Development of Speech Input for Multilingual Multimodal Applications for Mobile Scenarios

A Trainable Approach for Multi-Lingual Speech-To-Speech Translation System

An Efficient Unified Extraction Algorithm for Bilingual Data

عنوان ژورنال:

اشتراک گذاری